Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available July 16, 2026
-
Abstract In various applications of multi-robotics in disaster response, warehouse management, and manufacturing, tasks that are known a priori and tasks added during run time need to be assigned efficiently and without conflicts to robots in the team. This multi-robot task allocation (MRTA) process presents itself as a combinatorial optimization (CO) problem that is usually challenging to be solved in meaningful timescales using typical (mixed)integer (non)linear programming tools. Building on a growing body of work in using graph reinforcement learning to learn search heuristics for such complex CO problems, this paper presents a new graph neural network architecture called the covariant attention mechanism (CAM). CAM can not only generalize but also scale to larger problems than that encountered in training, and handle dynamic tasks. This architecture combines the concept of covariant compositional networks used here to embed the local structures in task graphs, with a context module that encodes the robots’ states. The encoded information is passed onto a decoder designed using multi-head attention mechanism. When applied to a class of MRTA problems with time deadlines, robot ferry range constraints, and multi-trip settings, CAM surpasses a state-of-the-art graph learning approach based on the attention mechanism, as well as a feasible random-walk baseline across various generalizability and scalability tests. Performance of CAM is also found to be at par with a high-performing non-learning baseline called BiG-MRTA, while noting up to a 70-fold improvement in decision-making efficiency over this baseline.more » « less
-
Tether-nets deployed from a chaser spacecraft are a promising solution to capturing space debris. The success of the one-shot capture process depends on the net’s structural dynamic properties, attributed to its physical design, and on the ability to perform an optimal launch and closure subject to sensing and actuation uncertainties. Hence, this paper presents a reliability-based optimization framework to simultaneously optimize the net design and its launch and closing actions to minimize the system mass (case 1) or closing time (case 2) while preserving a specified probability of capture success. Success is assessed in terms of a capture quality index and the number of locked node pairs. Gaussian noise is used to model the uncertainties in the dynamics, state estimation, and actuation of the tether-net, which is propagated via Monte Carlo sampling. To account for uncertainties and ensure computational efficiency, given the cost of simulating the tether-net dynamics, Bayesian optimization is used to solve this problem. Optimization results show that the mission success rate in the presence of uncertainties has increased from 75% to over 98%, while the capture completion time has almost halved.more » « less
-
Abstract For a wide variety of envisioned humanitarian and commercial applications that involve a human user commanding a swarm of robotic systems, developing human-swarm interaction (HSI) principles and interfaces calls for systematic virtual environments to study such HSI implementations. Specifically, such studies are fundamental to achieving HSI that is operationally efficient and can facilitate trust calibration through the collection-use-modeling of cognitive information. However, there is a lack of such virtual environments, especially in the context of studying HSI in different operationally relevant contexts. Building on our previous work in swarm simulation and computer game-based HSI, this paper develops a comprehensive virtual environment to study HSI under varying swarm size, swarm compliance, and swarm-to-human feedback. This paper demonstrates how this simulation environment informs the development of an indoor physical (experimentation) environment to evaluate the human cognitive model. New approaches are presented to simulate physical assets based on physical experiment-based calibration and the effects that this presents on the human users. Key features of the simulation environment include medium fidelity simulation of large teams of small aerial and ground vehicles (based on the Pybullet engine), a graphical user interface to receive human command and provide feedback (from swarm assets) to human in the case of non-compliance with commands, and a lab-streaming layer to synchronize physiological data collection (e.g., related to brain activity and eye gaze) with swarm state and human commands.more » « less
-
The earth’s orbit is becoming increasingly crowded with debris that poses significant safety risks to the operation of existing and new spacecraft and satellites. The active tether-net system, which consists of a flexible net with maneuverable corner nodes, launched from a small autonomous spacecraft, is a promising solution to capturing and disposing of such space debris. The requirement of autonomous operation and the need to generalize over debris scenarios in terms of different rotational rates makes the capture process significantly challenging. The space debris could rotate about multiple axes, which along with sensing/estimation and actuation uncertainties, call for a robust, generalizable approach to guiding the net launch and flight – one that can guarantee robust capture. This paper proposes a decentralized actuation system combined with reinforcement learning based on prior work in designing and controlling this tether-net system. In this new system, four microsatellites with thrusters act as the corner nodes of the net, and can thus help control the flight of the net after launch. The microsatellites pull the net to complete the task of approaching and capturing the space debris. The proposed method uses a reinforcement learning framework that integrates a proximal policy optimization to find the optimal solution based on the dynamics simulation of the net and the MUs in Vortex Studio. The reinforcement learning framework finds the optimal trajectory that is both energy-efficient and ensures a desired level of capture qualitymore » « less
An official website of the United States government

Full Text Available